Cluster Tendency Assessment for Fuzzy Clustering of Incomplete Data
نویسندگان
چکیده
The quality of results for partitioning clustering algorithms depends on the assumption made on the number of clusters presented in the data set. Applying clustering methods on real data missing values turn out to be an additional challenging problem for clustering algorithms. Fuzzy clustering approaches adapted to incomplete data perform well for a given number of clusters. In this study, we analyse different cluster validity functions in terms of applicability on incomplete data on the one hand. On the other hand we analyse in experiments on several data sets to what extent the clustering results produced by fuzzy clustering methods for incomplete data reflect the distribution structure of data.
منابع مشابه
A Fuzzy C-means Algorithm for Clustering Fuzzy Data and Its Application in Clustering Incomplete Data
The fuzzy c-means clustering algorithm is a useful tool for clustering; but it is convenient only for crisp complete data. In this article, an enhancement of the algorithm is proposed which is suitable for clustering trapezoidal fuzzy data. A linear ranking function is used to define a distance for trapezoidal fuzzy data. Then, as an application, a method based on the proposed algorithm is pres...
متن کاملClustering of Fuzzy Data Sets Based on Particle Swarm Optimization With Fuzzy Cluster Centers
In current study, a particle swarm clustering method is suggested for clustering triangular fuzzy data. This clustering method can find fuzzy cluster centers in the proposed method, where fuzzy cluster centers contain more points from the corresponding cluster, the higher clustering accuracy. Also, triangular fuzzy numbers are utilized to demonstrate uncertain data. To compare triangular fuzzy ...
متن کاملDirect Marketing Based on Fuzzy Clustering of Customers (Case Study: on one Mobile Company)
Objective There is a general tendency toward direct marketing these days. Therefore, instead of designing advertisement and marketing strategies for all the customers in the market, it is recommended to classify the customers based on clustering techniques and then design specific strategies accordingly. This will reduce marketing and advertisement expenses, increase sale department efficientl...
متن کاملDealing with Incomplete Data in Clustering
Over the years, significant developments have taken place in the direction of clustering numeric, categorical or mixed data. A new challenge is to cluster data with missing attribute values. The early algorithms used Fuzzy c-means to partition data into fuzzy clusters and estimate the missing values through estimation algorithms. Recently, Hathaway and Bezdek have proposed four strategies for e...
متن کاملClustering Large Data with Mixed Values Using Extended Fuzzy Adaptive Resonance Theory
Clustering is one of the technique or approach in content mining and it is used for grouping similar items. Clustering software datasets with mixed values is a major challenge in clustering applications. The previous work deals with unsupervised feature learning techniques such as k-Means and C-Means which cannot be able to process the mixed type of data. There are several drawbacks in the prev...
متن کامل